Refactoring Corpora
نویسندگان
چکیده
We describe a pilot project in semiautomatically refactoring a biomedical corpus. The total time expended was just over three person-weeks, suggesting that this is a cost-efficient process. The refactored corpus is available for download at http://bionlp.sourceforge.net.
منابع مشابه
Corpus Refactoring: a Feasibility Study
BACKGROUND Most biomedical corpora have not been used outside of the lab that created them, despite the fact that the availability of the gold-standard evaluation data that they provide is one of the rate-limiting factors for the progress of biomedical text mining. Data suggest that one major factor affecting the use of a corpus outside of its home laboratory is the format in which it is distri...
متن کاملUC3M System: Determining the Extent, Type and Value of Time Expressions in TempEval-2
This paper describes the participation of Universidad Carlos III de Madrid in Task A of the TempEval-2 evaluation. The UC3M system was originally developed for the temporal expressions recognition and normalization (TERN task) in Spanish texts, according to the TIDES standard. Current version supposes an almost-total refactoring of the earliest system. Additionally, it has been adapted to the T...
متن کاملReengineering Linguistic Resources for Machine Translation in Medical Applications
We discuss some key methodological and operational aspects related to the design and development of a machine translation (MT) prototype which can be integrated in healthcare information systems. We first describe the approach adopted for collecting, formating, sampling and analyzing multilingual corpora of diagnostic expressions. The resulting generic language representation model is then pres...
متن کاملAssessing the Quality of Refactoring Patterns for Introducing Design Patterns
Refactoring is a well-known process to improve the code design of object-oriented programs. Recently, several literatures focus on refactoring with introducing design patterns that are general repeated solutions to common problems in software design. For making it easy to perform such refactoring, a lot of refactoring patterns are proposed. Each refactoring pattern includes a description of ref...
متن کاملAssisting Refactoring Tool Development through Refactoring Characterization
Tool support for refactoring is widespread nowadays. The most widely known IDEs include refactoring support, and many refactoring-specific tools are also available. Developers are aware of refactoring activities and they do refactor their applications even manually or in an assisted way. For the users of refactoring tools, the current state of the art is well documented in refactoring catalogs,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006